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Description 

The present invention relates in general to methods 
and systems for permitting backup copying of datasets 
in external storage associated with accessing data 
processing systems, and in particular the present inven- 
tion relates to backup copying of datasets in external 
storage utilizing sidefile storage of updated portions of 
the designated datasets. 

Still more particularly, the present invention relates 
to a method and system for automatic sidefile polling in 
a data processing system during a time zero backup 
copying operation. 

A modem data processing system must be pre- 
pared to recover, not only from corruptions of stored da- 
ta which occur as a result of noise bursts, software bugs, 
media defects, and write path errors; but also from glo- 
bal events, such as data processing system power fail- 
ure. The most common technique of ensuring the con- 
tinued availability of data within a data processing sys- 
tem is to create one or more copies of selected datasets 
within a data processing system and store those copies 
in a nonvolatile environment. This so-called "backup" 
process occurs within state-of-the-art external storage 
systems in modem data processing systems. 

Backup policies are implemented as a matter of 
scheduling. Backup policies have a space and time di- 
mension which is exemplified by a range of datasets and 
by the frequency of backup occurrence. A FULL backup 
requires the backup of an entire range of a dataset, 
whether individual portions of that dataset have been 
updated or not. An INCREMENTAL backup copies only 
that portion of the dataset which has been updated since 
a previous backup, either full or incremental. The back- 
up copy thus created represents a consistent view of the 
data within the dataset as of the time the copy was cre- 
ated. 

Of course, those skilled in the art will appreciate that 
as a result of the process described above, the higher 
the backup frequency, the more accurately the backup 
copy will mirror the current state of data within a dataset. 
In view of the large volumes of data maintained within 
a typical state-of-the-art data processing system back- 
ing up that data is not a trivial operation. Thus, the op- 
portunity cost of backing up data within a dataset may 
be quite high on a large multiprocessing, multiprogram- 
ming facility, relative to other types of processing. 

Applications executed within a data processing sys- 
tem are typically executed in either a batch (streamed) 
or interactive (transactional) mode. In a batch mode, 
usually one application at a time executes without inter- 
ruption. Interactive mode is characterized by interrupt 
driven multiplicity of applications or transactions. 

When a data processing system is in the process 
of backing up data in either a streamed or batch mode 
system, each process, task or application within the data 
processing system is affected. That is, the processes 
supporting streamed or batch mode operations are sus- 
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pended for the duration of the copying. Those skilled in 
the art will recognize that this event is typically referred 
to as a "backup window." In contrast to batch mode op- 
erations, log based or transaction management appli- 

s cations are processed in the interactive mode. Such 
transaction management applications eliminate the 
"backup window" by concurrently updating an on-line 
dataset and logging the change. However, this type of 
backup copying results in a consistency described as 

10 "fuzzy. " That is, the backup copy is not a precise "snap- 
, shot" of the state of a dataset/data base at a single point 
in time. Rather, a log comprises an event file requiring 
further processing against the database. 

European Patent Application N.90307839.2 corre- 

15 sponding to EP-A-0 410 630 illustrates backup in a 
batch mode system utilizing a modified incremental pol- 
icy. A modified incremental policy copies only new data 
or data updates since the last backup. It should be noted 
that execution of applications within the data processing 

20 system are suspended during copying in this system. 

As described above, to establish a prior point of 
consistency in a log based system, it is necessary to 
"repeat history" by replaying the log from the last check 
point over the datasets or database of interest. The dis- 

25 tinction between batch mode and log based backup is 
that the backup copy is consistent and speaks as of the 
time of its last recordation, whereas the log and data- 
base mode require further processing in the event of a 
fault, in order to exhibit a point in time consistency. 

30 United States Patent No. 4,507,751 , Gawlick et al. , 
entitled Method and Apparatus for Logging Journal Data 
Using a Write Ahead Dataset, issued March 25, 1985, 
exemplifies a transaction management system wherein 
all transactions are recorded on a log on a write-ahead 

35 dataset basis. As described within this patent, a unit of 
work is first recorded on the backup medium (tog) and 
then written to its external storage address. 

Co-pending United States Patent Application Serial 
No. 07/524,206, filed May 1 6, 1 990, entitled Method and 

40 Apparatus for Executing Critical Disk Access Com- 
mands, teaches the performance of media maintenance 
on selected portions of a tracked cyclic operable mag- 
netic media concurrent with active access to other por- 
tions of the storage media. The method described there- 

45 in requires the phased movement of customer data be- 
tween a target track to an alternate track, diversion of 
all concurrent access requests to the alternate track or 
tracks and the completion of maintenance and copy 
back from the alternate to the target track. 

50 Requests and interrupts which occur prior to exe- 
cuting track-to-track customer data movement result in 
the restarting of the process. Otherwise, requests and 
interrupts occurring during execution of the data move- 
ment view a DEVICE BUSY state. This typically causes 

55 a requeueing of the request. 

The invention as claimed is intended to overcome 
these drawbacks. 

It is therefore one object of the present invention to 
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provide an improved method and system for backup 
copying of datasets in external storage associated with 
accessing data processing systems. 

It is another object of the present invention to pro- 
vide an improved method and system for backup copy- s 
ing of designated datasets in external storage utilizing 
sidefile storage of updated portions of the designated 
datasets. 

It is yet another object of the present invention to 
provide an improved method and system for automatic 10 
sidefile status polling in a data processing system during 
a time zero backup copying operation. 

The invention as claimed relates to a method and 
system for enhanced efficiency of backup copying of 
designated datasets stored within a plurality of storage * 5 
devices coupled to the data processing system via a 
storage subsystem control unit having subsystem mem- 
ory therein. Application execution within the data 
processing system is temporarily suspended long 
enough to form a dataset logicaMo-physical system ad- 20 
dress concordance to be utilized to administer copying 
of the designated dataset. Thereafter, application initi- 
ated updates to uncopied portions of the designated da- 
tasets are temporarily deferred until sidefiles of the af- 
fected portions of the designated datasets are written to 25 
subsystem memory. The updates are then written to the 
storage subsystem. Portions of the designated datasets 
are then accessed and copied from the storage subsys- 
tem on a scheduled or opportunistic basis utilizing se- 
lected data retrieval command sequences. A sidefile so 
status query is appended to selected data retrieval com- 
mand sequences and the presence of data within the 
subsystem memory is determined without the necessity 
of additional communications between the data 
processing system and the storage subsystem. The 35 
sidefiles are then accessed and copied. Copied portions 
of the designated datasets and sidefiles are then copied 
to alternate storage locations in a backup copy order de- 
fined by the address concordance. 

The present application is related to the European 40 
Patent Application No. PCT/EPg2/021 27, correspond- 
ing to WO-A-93/08529 entitled Method and Means for 
Time Zero Backup Copying of Data, filed September 1 6, 
1 992, and assigned to the assignee herein named. 

The novel features believed characteristic of the in- 45 
vention are set forth in the appended claims. The inven- 
tion itself however, as well as a preferred mode of use. 
further objects and advantages thereof, will best be un- 
derstood by reference to the following detailed descrip- 
tion of an illustrative embodiment when read in conjunc- so 
tion with the accompanying drawings, wherein: 

Figure 1 depicts a typical multiprocessing, multipro- 
gramming environment according to the prior art 
where executing processors and applications ran- ss 
domfy or sequentially access data from external 
storage; 

Figures 2A-2B depict time line illustrations of the 



backup window in a batch or streaming process in 
the prior art and in a time zero backup system; 
Figure 3 illustrates a conceptual flow of a time zero 
backup copy in accordance with the method and 
system of the present invention; 
Figure 4 is a high level flowchart illustrating initiali- 
zation of a time zero backup copy in accordance 
with the method and system of the present inven- 
tion; and 

Figure 5 is a high level logic flowchart illustrating 
backup copying in accordance with the method and 
system of the present invention; and 
Figure 6 is a high level logic flowchart illustrating 
automatic sidefile polling in accordance with the 
method and system of the present invention. 

With reference now to the figures and in particular 
with reference to Figure 1 , there is depicted a multiproc- 
essing, multiprogramming data processing system ac- 
cording to the prior art. Such systems typically include 
a plurality of processors 1 and 3 which access external 
storage units 21, 23, 25, 27, and 29 over redundant 
channel demand/response interfaces 5, 7 and 9. 

The illustrated embodiment in Figure 1 may be pro- 
vided in which each processor within the data process- 
ing system is implemented utilizing an IBM/360 or 370 
architected processor type having, as an example, an 
IBM MVS operating system. An IBM/360 architected 
processor is fully described in Amdahl et al. t U.S. Patent 
No. 3,400,371 , entitled Data Processing System, issued 
on September 3, 1968. A configuration in which multiple 
processors share access to external storage units is set 
forth in Luiz et aL, U.S. Patent No. 4,207,609, entitled 
Path Independent Device Reservation and Reconnec- 
tion in a Multi-CPU and Shared Device Access System, 
issued January 10, 1980. 

The MVS operating system is also described in IBM 
Publication GC28-1150, entitled MVS/Extended Archi- 
tecture System Programming Library: System Macros 
and Facilities, Vol. 1 . Details of standard MVS or other 
operating system services, such as local lock manage- 
ment, subsystem invocation by interrupt or monitor, and 
the posting and waiting of tasks is omitted. These oper- 
ating systems services are believed to be well known to 
those having skill in this art. \ 

Still referring to Figure 1 , as described in U.S. Pat- 
ent No.4,207,609, a processor process may establish a 
path to externally stored data in an IBM System 370 or 
similar system through an MVS or other known operat- 
ing system by invoking a START I/O, transferring control 
to a channel subsystem which reserves a path to the 
data over which transfers are made. Typically, executing 
applications have data dependencies and may briefly 
suspend operations until a fetch or update has been 
completed. During such a transfer, the path is locked 
until the transfer is completed. 

Referring now to Figures 2A-2B, there are depicted 
time lines illustrating the backup window in a batch or 
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streaming process in the prior art and in a time zero 
backup system. As illustrated at Figure 2A, multiple 
backup operations have occurred, as indicated at back- 
up windows 41 and 43. Application processing is typi- 
cally suspended or shut down just prior to each backup 
window and this suspension will persist until the backup 
process has been completed. Termination of the backup 
window signifies completion of the backup process and 
commitment. By "completion" what is meant is that all 
data that was to have been copied was in fact read from 
the source. By "commitment" what is meant is that all 
data to be copied was in fact written to an alternate stor- 
age location. 

European Patent Application EP-A-399560 entitled 
"Method of producing a duplication of a database" dis- 
closes a method for reducing the time required for writ- 
ing the differential copy data and update log data over 
the copy data, by a better using of the mechanical char- 
acteristics of the external storage. However with the tra- 
ditional backup system, even reducing as much as pos- 
sible the time required for physical backup, the applica- 
tion execution must be suspended during the whole 
backup window. 

Referring now to Figure 2B, backup windows for a 
time zero backup copy system are depicted. As de- 
scribed in detail within the co-pending cross-referenced 
patent application, each backup window 45 and 47 still 
requires the suspension or termination of application 
processing; however, the suspension or termination oc- 
curs only for a very short period of time. As described 
in the cross-referenced application, the time zero back- 
up method begins, effectively freezing data within the 
datasets to be backed up at that point in time. Thereaf- 
ter, a bit map is created identifying each track within the 
datasets to be backed up and after creation of that bit 
map, the copy is said to be "logically complete". The 
committed state, or "physically complete" state will not 
occur until some time later. However, at the "logically 
complete" point in time, the data is completely usable 
by applications within the data processing system. The 
time during which application processing is suspended 
in such a system is generally in the tow sub-second 
range; however, those skilled in the art will appreciate 
that the amount of time required to create a bit map to 
the data to be copied will depend upon the amount of 
data within the datasets. 

Of course, those skilled in the art will appreciate that 
if the time zero backup process terminates abnormally 
between the point of logical completion and the point of 
physical completion, the backup copy is no longer useful 
and the process must be restarted. In this respect, the 
time zero backup process is vulnerable in a manner very 
similar to that of backup systems in the prior art. That 
is, all backup operations must be rerun if the process 
terminates abnormally prior to completion. 

With reference now to Figure 3, there is depicted a 
conceptual flow of the creation of a time zero backup 
copy in accordance with the method and system of the 



present invention. As illustrated, a time zero backup 
copy of data within a tracked cyclic storage device 61 
may be created. As those skilled in the art will appreci- 
ate, data stored within such a device is typically organ- 

5 ized into records and datasets. The real address of data 
within external storage is generally expressed in terms 
of Direct Access Storage Device (DASD) volumes, cyl- 
inders and tracks . The virtual address of such data is 
generally couched in terms of base addresses and off- 

10 sets and/or extents from such base addresses. 

Further, a record may be of the count-key-data for- 
mat. A record may occupy one or more units of real stor- 
age. A "dataset" is a logical collection of multiple records 
which may be stored on contiguous units of real storage 

*5 or which may be dispersed. Therefore, those skilled in 
the art will appreciate that if backup copies are created 
at the dataset level it will be necessary to perform mul- 
tiple sorts to form inverted indices into real storage. For 
purposes of explanation of this invention, backup 

20 processing will be described as managed both at the 
resource manager level within a data processing system 
and at the storage control unit level. 

As described above, each processor typically in- 
cludes an operating system which includes a resource 

25 manager component. Typically, an I BM System 370 type 
processor running under the MVS operating system will 
include a resource manager of the Data Facilities Data- 
Set Services (DFDSS) type which is described in U.S. 
Patent No. 4,855,907, Ferro et al M issued August 8, 

30 1 989, entitled Method for Moving VSAM Base Clusters 
While Maintaining Alternate Indices Into the Cluster. 
DFDSS is also described in IBM Publication 
GC26-4388, entitled Data Facility Dataset Services: Us- 
er's Guide. Thus, a resource manager 63 is utilized in 

35 conjunction with a storage control unit 65 to create an 
incremental backup copy of designated datasets stored 
within tracked cyclic storage device 61 . 

As will be described below, the backup copy proc- 
ess includes an initialization period during which data- 

40 sets are sorted, one or more bit maps are created and 
logical completion of the bit map is signaled to the in- 
voking process at the processor. The listed or identified 
datasets are then sorted according to access path ele- 
ments down to DASD track granularity. Next, bit maps 

45 are constructed which correlate the dataset and the ac- 
cess path insofar as any one of them is included or ex- 
cluded from a given copy session. Lastly, resource man- 
ager 63 signals logical completion, indicating that up- 
dates will be processed against the dataset only after a 

50 short delay until such time as physical completion oc- 
curs. 

Following initialization, resource manager 63 be- 
gins reading the tracks of data which have been request- 
ed. This is accomplished by utilizing a unique control 
55 block within the data processing system which identifies 
a particular storage device, in association with a data 
retrieval command sequence which identifies specific 
data to be read. While a copy session is active, each 
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storage control unit monitors all updates to the dataset. 
If an update is received from another application 67, 
storage control unit 65 will execute a predetermined al- 
gorithm to process that update, as described below. 

In a time zero backup copy system a determination 
is first made as to whether or not the update attempted 
by application 67 is for a volume which is not within the 
current copy session. If the volume is not within the cur- 
rent copy session, the update completes normally. Al- 
ternately, if the update is for a volume which is part of 
the copy session, the primary session bit map is 
checked to see if that track is protected. If the corre- 
sponding bit within the bit map is off, indicating the track 
is not currently within a copy session, the update com- 
pletes normally. However, if the track is protected (the 
corresponding bit within the bit map is on) the track in 
question is part of the copy session and has not as yet 
been read by the resource manager 63. In such a case, 
storage control unit 65 temporarily buffers or defers the 
update and writes a copy of the affected track into a 
memory 66 within storage control unit 65. Thereafter, 
the update is permitted to complete. 

Thus, as illustrated in Figure 3, an update initiated 
by application 67 may be processed through storage 
control unit 65 to update data at tracks 3 and 5 within 
tracked cyclic storage device 61 . Prior to permitting the 
update to occur, tracks 3 and 5 are written as sidefiles 
to memory 66 within storage control unit 65 and there- 
after, the update is permitted to complete. The primary 
bit map is then altered to indicate that the copies of 
tracks 3 and 5, as those tracks existed at the time a 
backup copy was requested, are no longer within 
tracked cyclic storage device 61 but now reside within 
memory 66 within storage control unit 65. 

A merged copy, representing the designated data- 
set as of the time a backup copy was requested, is then 
created at reference numeral 69, by copying non-updat- 
ed tracks directly from tracked cyclic storage device 61 
through resource manager 63, or by indirectly copying 
those tracks from tracked cyclic storage device 61 to a 
temporary host sidefile 71 , which may be created within 
the expanded memory store of a host processor. Addi- 
tionally, tracks within the dataset which have been writ- 
ten as sidefiles to memory 66 within storage control unit 
65 prior to completion of an update may also be indirect- 
ly read from memory 66 within storage control unit 65 to 
the temporary host sidefile 71 . Those skilled in the art 
will appreciate that in this manner a copy of a designated 
dataset may be created from unaltered tracks within 
tracked cyclic storage device 61, from preupdated 
tracks stored within memory 66 of storage control unit 
65 and thereafter transferred to temporary host sidefile 
71, wherein these portions of the designated dataset 
may be merged in backup copy order, utilizing the bit 
map which was created at the time the backup copy was 
initiated. 

Referring now to Figure 4, there is depicted a high 
level logic flowchart which illustrates the initialization of 



a process for creating a time zero backup copy, in ac- 
cordance with the method and system of the present in- 
vention. As illustrated, this process starts at block 81 
and thereafter passes to block 83 which illustrates the 

s beginning of the initialization process. Thereafter, the 
process passes to block 85 which depicts the sorting of 
the datasets by access path, down to DASD track gran- 
ularity. This sorting process will, necessarily, resolve an 
identification of the DASD volumes within which the da- 

10 tasets reside and the identification of the storage control 
units to which those volumes belong. 

Next, as depicted at block 87, a session identifica- 
tion is established between each processor and the rel- 
evant external storage control units. The session kten- 

15 tification is preferably unique across all storage control 
units, in order that multiple processors will not interfere 
with each others' backup copy processes. Thereafter, 
as illustrated at block 89, a session bit map is estab- 
lished which may be utilized, as set forth in detail herein 

20 and within the cross-referenced patent application, to 
indicate whether or not a particular track is part of the 
present copy session. Thereafter, as depicted at block 
91 , the "logically complete" signal is sent to the invoking 
process, indicating that application processing may con- 

25 tinue; however, slight delays in updates will occur until 
such time as the backup copy is physically complete. 

With reference now to Figure 5, there is depicted a 
high level logic flowchart which illustrates the backup 
copying of a dataset in accordance with the method and 

30 system of the present invention. As illustrated, the proc- 
ess begins at block 99 and thereafter passes to block 
101. Block 101 depicts the beginning of the reading of 
a backup copy. The process then passes to block 103 
which illustrates a determination of whether or not an 

35 update has occurred. In the event no update has oc- 
curred, the process merely iterates until such time as an 
update does occur. In the event an update has occurred, 
the process passes to block 105. Block 105 illustrates 
a determination of whether or not the update initiated by 

40 an application within the data processing system is an 
update against a portion of the time zero dataset. If not, 
the process merely passes to block 1 07 and the update 
is processed in a normal fashion. However, in the event 
the update is against a portion of the time zero dataset, 

45 the process passes to block 109. 

Block 109 illustrates a determination of whether or 
not the update is against a copied or uncopied portion 
of the time zero dataset. That is, an update to a portion 
of data within the dataset which has been copied to the 

so backup copy and is therefore physically complete, or a 
portion which has not yet been copied to the backup 
copy or exists in a sidefile. If the portion of the dataset 
against which the update is initiated has already been 
copied to the backup copy or resides in a sidefile, the 

55 process passes to block 107 which illustrates the 
processing of the update. Again, the process then pass- 
es from block 1 07 to block 103, to await the occurrence 
of the next update. 
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Referring again to block 1 09, in the event the update 
against the time zero dataset is initiated against a por- 
tion of the time zero dataset which has not yet been cop- 
ied to the backup copy, the process passes to block 1 1 3. 
Block 1 1 3 illustrates the temporary deferring or buffering 
of the update and the copying of the affected portion of 
the time zero dataset to a sidefile within memory 66 with- 
in storage control unit 65 (see Figure 3). Thereafter, the 
process passes to block 1 1 5, which illustrates the mark- 
ing of the primary session bit map, indicating to resource 
manager 63 that this portion of the dataset has been 
updated within the external storage subsystem and that 
the time zero copy of this portion of the dataset is now 
either within memory 66 within storage control unit 65 
or within temporary host sidefile 71 which is utilized to 
prevent overflow of data within memory 66 within stor- 
age control unit 65 (see Figure 3). 

After marking the primary session bit map, the proc- 
ess passes to block 1 1 7 which illustrates the processing 
of that update. Thereafter, the process passes to block 
119 which depicts a determination of whether or not the 
sidefile threshold within the memory of storage control 
unit 65 has been exceeded. If so, the process passes 
to block 121, which illustrates the generation of an at- 
tention signal, indicating that sidefiles within memory 66 
in storage control unit 65 are ready to be copied by the 
processor. Of course, those skilled in the art will appre- 
ciate that a failure to copy data from memory 66 within 
storage control unit 65 may result in the corruption of 
the backup copy if that memory 66 is overwritten. Re- 
ferring again to block 119, in the event the sidefile 
threshold has not been exceeded, the process returns 
again to block 103 to await the occurrence of the next 
update. 

The asynchronous copying of sidefile data from 
memory 66 within storage control unit 65 to a temporary 
host sidefile, or to the merged backup copy, is described 
in detail within the cross-referenced patent application, 
as well as the process by which merged copies are cre- 
ated which incorporate data read directly from tracked 
cyclic storage device 61 , data within memory 66 within 
storage control unit 65 and/or data within temporary host 
sidefile 71 . 

Referring now to Figure 6, there is depicted a high 
level logic flowchart which illustrates automatic sidefile 
polling in accordance with the method and system of the 
present invention. As will be appreciated upon reference 
to the foregoing, if the level of data within subsystem 
memory exceeds one or more selected threshold levels, 
attention signals may be automatically transmitted to the 
data processing system, indicating the data within sub- 
system memory must be transferred from that location 
to alternate storage prior to the overwriting of data within 
subsystem memory and a resultant corruption of the 
backup copy data contained therein. 

Those skilled in the art will appreciate that such a 
system, incorporating so-called "attention" signals will 
provide a monitor system which may be utilized to notify 



the data processing system when data must be copied; 
however, the efficiency of the backup copy process may 
be greatly enhanced by providing a sidefile status query 
which may be transmitted to storage system control unit 

5 65 (see Figure 3) as a selected Channel Control Word 
(CCW). Further, as will be explained in greater detail be- 
low, the sidefile status query may be periodically ap- 
pended to a data retrieval command sequence, elimi- 
nating the communications overhead which might oth- 

10 erwise be necessary to establish communication be- 
tween the data processing system and the storage sub- 
system. The sidefile status query may then be utilized 
to determine if data is present within the subsystem 
memory, allowing the data processing system to selec- 

15 tivefy access and copy that data during periods of low 
channel utilization, greatly enhancing the efficiency of 
the backup copy process. 

The automatic sidefile polling method of the present 
invention begins, as illustrated in Figure 6, at block 131 

20 and thereafter passes to block 1 33. Block 1 33 illustrates 
a determination of whether or not a sidefile status query 
has been received and if so, the process passes to block 
135. Block 135 illustrates the transmittal of sidefile sta- 
tus to the data processing system and if the sidefile area 

25 within subsystem memory is not empty, a data retrieval 
command may be issued to read the sidefile data from 
the subsystem memory within the storage subsystem 
control unit to the data processing system. Thereafter, 
the process will return rteratively to block 133. 

30 Referring again to block 1 33, in the event a sidefile 
status query is not received, the process passes to block 
137. Block 1 37 illustrates a determination of whether or 
not a data retrieval command sequence has been re- 
ceived. If not, the process again returns iteratrvely to 

35 block 1 33 to await the arrival of a sidefile status query 
message or a data retrieval command sequence or oth- 
er appropriate message. In the event a data retrieval 
command sequence has been received the process 
passes to block 1 39. 

40 Block 139 illustrates the retrieval of the requested 
data and the transmittal of that data to the data process- 
ing system. Thereafter, as above, the process passes 
to block 141 which illustrates a determination of whether 
or not a sidefile status query was appended to the re- 

45 ceived data retrieval command sequence and, if so, the 
process passes to block 143. As above, block 143 illus- 
trates the transmittal of the sidefile status to the data 
processing system. Referring again to block 141 , in the 
event a sidefile status query has not been appended to 

50 the data retrieval command sequence, or after transmit- 
ting the sidefile status to the data processing system, in 
the event a sidefile status query was appended, the 
process returns to block 1 33 to await the arrival of a side- 
file status query, a data retrieval command sequence, 

55 or other appropriate command. 

Upon reference to the foregoing those skilled in the 
art will appreciate that the method and system of the 
present invention provides an efficient method whereby 
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the status of sidefile copies of affected updated desig- 
nated dataset portions may be determined by the data 
processing system so that data may be transmitted to 
the data processing system in a manner which greatly 
enhances the efficiency of the backup copy process. 



Claims 

1 . A method in a data processing system for enhanced 
efficiency of backup copying of designated datasets 
stored within a storage subsystem (65,66,61 ) com- 
prising a plurality of storage devices (61 ) which are 
coupled to said data processing system via a stor- 
age control unit (65) having subsystem memory 
(66) therein, during application execution within 
said data processing system, said method compris- 
ing the steps of: 

a) suspending application execution within said 
data processing system, forming a dataset log- 
ical-to-physical system address concordance 
for said designated datasets to be utilized to ad- 
minister copying of said designated datasets 
and resuming application execution thereafter; 

b) processing at said storage subsystem 
(65,66,61) any application initiated update to 
uncopied portions of said designated datasets 
by temporarily deferring said updates, writing 
sidefiles of said designated datasets or por- 
tions thereof affected by said update to said 
subsystem memory (66) and thereafter writing 
said updates to said storage subsystem; 

c) accessing and copying said designated da- 
tasets within said storage subsystem 
(65,66,61 ) on a scheduled or opportunistic ba- 
sis by issuing data retrieval command sequenc- 
es from said data processing system to said 
storage subsystem (65,66,61); 

d) periodically issuing a sidefile status query or 
periodically appending said sidefile status que- 
ry to a data retrieval command sequence, from 
said data processing system to said storage 
subsystem (65,66,61), allowing the determina- 
tion of data presence within said subsystem 
memory (66); and 

e) selectively accessing and copying said side- 
files in response to a determination of data 
presence within said subsystem memory (66) ; 

f) writing said copied designated datasets and 
sidefiles to an alternate storage location in a 
backup copy order specified by said address 
concordance. 

2. A data processing system for enhanced efficiency 
of backup copying, during application execution, of 
designated datasets stored within a storage sub- 
system (65,66,61) comprising a plurality of storage 



devices (61) which are coupled to said data 
processing system via a storage control unit (65) 
having subsystem memory (66) therein, said data 
processing system comprising: 

5 

a) means for temporarily suspending applica- 
tion execution within said data processing sys- 
tem prior to forming a dataset logical-to-physi- 
cal system address concordance for said des- 

io ignated datasets to be utilized to administer 

copying of said designated datasets and 
means for resuming application execution 
thereafter; 

b) means for processing at said storage sub- 
15 system (65,66,61 ) any application initiated up- 
date to uncopied portions of said designated 
datasets by temporarily deferring said updates, 
writing sidefiles of said designated datasets or 
portions thereof affected by said update to said 

20 subsystem memory (66) and thereafter writing 

said updates to said storage subsystem; 

c) means for accessing and copying said des- 
ignated datasets within said storage subsystem 
(65,66,61) on a scheduled or opportunistic ba- 

25 sis by issuing data retrieval command sequenc- 

es from said data processing system to said 
storage subsystem (65,66,61); 

d) means for periodically issuing a sidefile sta- 
tus query or periodically appending said sidefile 

30 status query to a data retrieval command se- 

quence, from said data processing system to 
said storage subsystem (65,66,61), allowing 
the determination of data presence within said 
subsystem memory (66); and 

35 e) means for selectively accessing and copying 

said sidefiles in response to a determination of 
data presence within said subsystem memory 
(66). 

f) means for writing said copied designated da- 
40 tasets and sidefiles to an alternate storage lo- 

cation in a backup copy order defined by said 
address concordance. 



45 Patentanspruche 

1. Verfahren in einem Datenverarbeitungssystem fur 
eine verbesserte Leistungsfahigkeit des Sicher- 
heitskopierens bezeichneter Datensatze wahrend 

50 der Ausfuhrungsform in dem Datenverarbeitungs- 
system, wobei die Datensatze in einem Speicher- 
subsystem (65, 66, 61) gespeichert sind, das eine 
Vielzahl von Speichereinheiten (61) umfaBt, die 
Ober eine Speichersteuereinheit (65) mit einem dar- 

55 in enthaltenen Subsystemspeicher (66) mit dem 
Datenverarbeitungsssystem verbunden sind, wo- 
bei das Verfahren die folgenden Schritte umfaBt: 
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a) Aussetzen der Anwendungsausfuhrung im 
Datenverarbeitungssystem, Bilden einer lo 
gisch-physischen Systemadressenuberein- 
stimmung des Datensatzes fur die bezeichne- 
ten Datensatze, die zur Verwattung des Kopie- & 
rens der bezeichneten Datensatze verwendet 
werden mussen, und anschlieBend Fortsetzen 
der Anwendungsausfuhrung; 

b) Verarbeiten jeder durch eine Anwendung 10 
eingeleiteten Aktualisierung an nicht kopierten 
Teilen der bezeichneten Datensatze im Spei- 
chersubsystem (65, 66, 61) durch voruberge- 
hendes Verzogem der Aktualisierungen, 
Schreiben von Seitendateien der bezeichneten is 
Datensatze Oder von Teilen da von, die von der 
Aktualisierung betroffen sind, in den Subsy- 
stemspeicher (66) und anschlieBend Schrei- 
ben der Aktualisierungen in das Speichersub- 
system; 20 

c) Zugreifen auf die und Kopieren der bezeich- 
neten Datensatze im Speichersubsystem (65, 
66, 61) auf geplanter Oder opportunistischer 
Basis durch die Ausgabe von Datenabruf-Be- 25 
fehlsfolgen vom Datenverarbeitungssystem an 
das Speichersubsystem (65, 66, 61 ); 

d) periodisches Ausgeben einer Seitendatei- 
Statusabfrage oder periodisches Anhangen 30 
der Seitendatei-Statusabfrage an eine Daten- 
abruf-Befehlsfolge durch das Datenverarbei- 
tungssystem an das Speichersubsystem (65, 

66, 61), wodurch das Vorhandensein von Da- 
ten im Subsystemspeicher (66) festgestellt 3S 
werden kann; und 

e) selektives Zugreifen auf die und Kopieren 
der Seitendateien auf eine Feststeltung des 
Vorhandenseins von Daten im Subsystemspei- 40 
cher (66) hin; 

f) Schreiben der kopierten und bezeichneten 
Datensatze und Seitendateien an eine andere 
Speicherposition in einer Sicherheitskopie-Rei- 45 
henfolge, die durch die Adressenubereinstim- 
mung angegeben wird. 

2. Datenverarbeitungssystem fur eine verbesserte 
Leistungsfahigkert des Sicherheitskopierens be- so 
zeichneter Datensatze wahrend der Anwendungs- 
ausfuhrung, wobei die Datensatze in einem Spei- 
chersubsystem (65, 66, 61 ) gespeichert sind, das 
eine Vielzahl von Speichereinheiten (61) umfaBt, 
die uber eine Speichersteuereinhert (65) mit dem 55 
darin enthaltenen Subsystemspeicher (66) mit dem 
Datenverarbeitungssystem verbunden sind, wobei 
das Datenverarbeitungssystem folgendes umfaBt: 



a) Mittel zum vorubergehenden Aussetzen der 
Anwendungsausfuhrung im Datenverarbei- 
tungssystem vor der Bildung einer logisch-phy- 
sischen Systemadressenubereinstimmung 
des Datensatzes fur die bezeichneten Daten- 
satze, die zur Verwattung des Kopierens der 
bezeichneten Datensatze verwendet werden 
mussen, und Mittel zum anschlieBenden Fort- 
setzen der Anwendungsausfuhrung; 

b) Mittel zur Verarbeitung jeder durch eine An- 
wendung eingeleiteten Aktualisierung an nicht 
kopierten Teilen der bezeichneten Datensatze 
im Speichersubsystem (65, 66, 61) durch vor- 
ubergehendes Verzogem der Aktualisierun- 
gen, zum Schreiben von Seitendateien der be- 
zeichneten Datensatze oder von Teilen da von, 
die von der Aktualisierung betroffen sind, in den 
Subsystemspeicher (66) und zum anschlie- 
Benden Schreiben der Aktualisierungen in das 
Speichersubsystem; 

c) Mittel zum Zugreifen auf die und Kopieren 
der bezeichneten Datensatze im Speichersub- 
system (65, 66; 61) auf geplanter oder oppor- 
tunistischer Basis durch die Ausgabe von Da- 
tenabruf-Befehlsfolgen vom Datenverarbei- 
tungssystem zum Speichersubsystem (65, 66, 
61); 

d) Mittel zum pertodischen Ausgeben einer Sei- 
tendatei-Statusabfrage oder zum periodischen 
Anhangen der Seitendatei-Statusabfrage an 
eine Datenabruf-Befehlsfolge durch das Da- 
tenverarbeitungssystem an das Speichersub- 
system (65, 66, 61 ), wodurch das Vorhanden- 
sein von Daten im Subsystemspeicher (66) 
festgestellt werden kann; und 

e) Mittel zum selektiven Zugreifen auf die und 
Kopieren der Seitendateien auf eine Feststel- 
lung des Vorhandenseins von Daten im Subsy- 
stemspeicher (66) hin; 

f) Mittel zum Schreiben der kopierten und be- 
zeichneten Datensatze und Seitendateien an 
eine andere Speicherposition in einer Sicher- 
hettskopie-Reihenfolge, die durch die Adres- 
senubereinstimmung definiert wird. 



Revendications 

1 . Un precede dans un systeme de traitement de don- 
nees pour ameliorer I'efficacite de la copie de sau- 
vegarde de jeux de donnees designes, memorises 
dans un sous-systeme de stockage (65, 66, 61) 
constitue d'une plurality de dispositifs de stockage 
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(61) couples audit systeme de traitement de don- 
nees via une unite de commande de stockage (65) 
ayant en son sein une memoire de sous-systeme 
(166), durant ^execution d'une application dans le- 
dit systeme de traitement de donnees, (edit precede s 
comprenant les etapes conststant a : 

a) suspendre I'execution de I'application dans 
(edit systeme de traitement de donnees, former 
une concordance d'adresse de systeme togi- 10 
que-physique de jeu de donnees pour lesdits 
jeux de donnees designes a utiliser pour admi- 
nistrer la copie desdits jeux de donnees desi- 
gnes, et reprendre ensuite I'execution de I'ap- 
plication; 15 

b) traiter, dans ledit sous-systeme de stockage 
(65, 66, 61 ), toute mise a jour initiee par ('appli- 
cation sur des parties non copiees desdits jeux 

de donnees non designes par la mise en drffere 20 
temporaire desdites mises a jour, I'ecriture de 
fichiers lateraux desdits jeux de donnees desi- 
gnes ou de portions de ceux-ci affectees par 
ladite mise a jour sur ladite memoire sous-sys- 
teme (66), puis ecriture desdites mises a jour 25 
dans ledit sous-systeme de stockage; 

c) I'acces et la copie desdits jeux de donnees 
designes dans ledit sous-systeme de stockage 
(65, 66, 61 ) sur une base programmee ou op- -30 
portunistique, par emission de sequences 
^instruction de recuperation de donnees a par- 

tir dudit systeme de traitement, sur ledit sous- 
systeme de stockage (65, 66, 61); 

35 

d) emettre periodiquement une consultation 
d'etat de fichier lateral, ou annexe r periodique- 
ment ladite consultation d'etat de fichier lateral 
a une dite sequence ^instruction de recupera- 
tion de donnees, dudit systeme de traitement *o 
de donnees audit sous-systeme de stockage 
(65, 66, 61), en penmettant la determination de 

la presence de donnees dans ladite memoire 
sous-systeme (66); et 

45 

e) acceder selectivement et copier lesdits fi- 
chiers lateraux en reponse a la determination 
de la presence de donnees dans ledit sous-sys- 
teme memoire (66; 

so 

f) ecriture desdits jeux de donnees designes, 
copies et des fichiers lateraux sur un emplace- 
ment de stockage alternate dans un ordre de 
copie de sauvegarde specifie par ladite concor- 
dance d'adresse. 55 

Un systeme de traitement de donnees pour amelio- 
rer le rendement de la copie de sauvegarde, pen- 



dant I'execution d'une application, de jeux de don- 
nees designes stockes dans un sous-systeme de 
stockage (65, 66, 61) comprenant une pluralite de 
dispositifs de stockage (61 ) couples audit systeme 
de traitement de donnees via une unite de comman- 
de de stockage (65) ayant en son sein une memoire 
de sous-systeme (66), ledit systeme de traitement 
de donnees comprenant : 

a) des moyens pour suspendre temporaire- 
ment I'execution de I'application dans ledit sys- 
teme de traitement de donnees avant de former 
une concordance d'adresse logique-physique 
de jeu de donnees, pour lesdits jeux de don- 
nees designes a utiliser, pour admin istrer la co- 
pie desdits jeux de donnees designes et des 
moyens pour reprendre ensuite I'execution 
duplication; 

b) des moyens pour traiter, au niveau dudit 
sous-systeme de stockage (65, 66, 61 ), toute 
mise a jour initiee par I'application sur des par- 
ties non copiees desdits jeux de donnees desi- 
gnes, en mettant en differe temporaire lesdites 
mises a jour, ecriture des fichiers lateraux des- 
dits jeux de donnees designes ou de portions 
de ceux-ci affectees par ladite mise a jour sur 
ladite memoire de sous-systeme (66), puis ecri- 
ture desdites mises a jour sur ledit sous-syste- 
me de stockage; 

c) des moyens pour etablir I'acces et la copie 
desdits jeux de donnees designes dans ledit 
sous-systeme de stockage (65, 66, 61 ) sur une 
base programmee ou opportunistique, en 
emettant des sequences destruction de recu- 
peration de donnees a partir dudit systeme de 
traitement de donnees, vers ledit sous-systeme 
de stockage (65, 66, 61 ); ' 

d) des moyens, pour emettre periodiquement 
une consultation d'etat de fichier lateral ou an- 
nexer periodiquement ladite consultation d'etat 
de fichier lateral a une sequence destruction 
de recuperation de donnees, dudit systeme de 
traitement de donnees audit sous-systeme de 
stockage (65, 66, 61 ), en permettant la deter- 
mination de presence de donnees dans ladite 
memoire sous-systeme (66); et 

e) des moyens pour acceder selectivement 
aux, et copier les, dits fichiers lateraux en re- 
ponse a la determination de la presence de 
donn6es dans ladite memoire de sous-systeme 
(66); 

f) des moyens pour ecriture lesdits jeux de don- 
nees designes, copies et des fichiers lateraux 
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sur un emplacement de stockage atternatif 
dans un ordre de copie de sauvegarde defini 
par ladite concordance d'adresse. 
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